[shardformer] fix gathering output when using tensor parallelism #5431
+32
−13
We went looking everywhere, but couldn’t find those commits.
Sometimes commits can disappear after a force-push. Head back to the latest changes here.